Method and system for URL autocompletion using ranked results

ABSTRACT

A set of ordered predicted completion strings are presented to a user as the user enters text in a text entry box (e.g., a browser or a toolbar). The predicted completion strings can be in the form of URLs or query strings. The ordering may be based on any number of factors (e.g., a query&#39;s frequency of submission from a community of users). URLs can be ranked based on an importance value of the URL. Privacy is taken into account in a number of ways, such as using a previously submitted query only when more than a certain number of unique requesters have made the query. The sets of ordered predicted completion strings is obtained by matching a fingerprint value of the user&#39;s entry string to a fingerprint to table map which contains the set of ordered predicted completion strings.

RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.10/987,294, filed Nov. 11, 2004 now U.S. Pat. No. 7,499,940, which isincorporated by reference herein in its entirety.

This application is related to “Anticipated Query Generation andProcessing in a Search Engine,” U.S. patent application Ser. No.10/875,143, filed Jun. 22, 2004, which is incorporated by referenceherein in its entirety.

TECHNICAL FIELD

The present invention relates generally to the field of search enginesfor locating documents in a computer network (e.g., a distributed systemof computer systems), and in particular, to a system and method forspeeding up a desired search by anticipating a user's request.

BACKGROUND

Search engines provide a powerful tool for locating documents in a largedatabase of documents, such as the documents on the World Wide Web (WWW)or the documents stored on the computers of an Intranet. The documentsare located in response to a search query submitted by a user. A searchquery may consist of one or more search terms.

In one approach to entering queries, the user enters the query by addingsuccessive search terms until all search terms are entered. Once theuser signals that all of the search terms of the query have beenentered, the query is sent to the search engine. The user may havealternative ways of signaling completion of the query by, for example,entering a return character, by pressing the enter key on a keyboard orby clicking on a “search” button on a graphical user interface. Once thequery is received by the search engine, it processes the search query,searches for documents responsive to the search query, and returns alist of documents to the user.

Because the query is not sent to the search engine until the user hassignaled that the query is complete, time passes while the user isfinishing the full search query. It would be desirable to have a systemand method of speeding up this process.

SUMMARY

In one embodiment, a system and method for processing query informationincludes receiving query information from a search requestor, prior tothe user indicating completion of the query. From the received queryinformation a set of predicted queries ordered in accordance with aranking criteria is predicted. The set of ordered predicted queries isthen transmitted to the search requester.

The search requester may select a respective query from the ordered setof predicted queries and then indicate completion of the query. A searchengine processes the query to produce a set of search results.Alternately, the search requester may continue entering queryinformation until a complete query is entered, or until a new set ofpredicted queries is transmitted and presented to the search requester.

BRIEF DESCRIPTION OF THE DRAWINGS

The aforementioned embodiment of the invention as well as additionalembodiments will be more clearly understood as a result of the followingdetailed description of the various aspects of the invention when takenin conjunction with the drawings. Like reference numerals refer tocorresponding parts throughout the several views of the drawings.

FIG. 1 depicts a process for predicting queries in accordance withembodiments of the present invention.

FIG. 2 depicts a block diagram of a search system in accordance withembodiments of the present invention.

FIG. 3 depicts a process in a search assistant in accordance withembodiments of the present invention.

FIG. 4 depicts a process for receiving query input and creatingresponses thereto in accordance with embodiments of the presentinvention.

FIG. 5 depicts flows of information associated with creating and using afingerprint-to-table map in accordance with embodiments of the presentinvention.

FIG. 6 depicts examples of relevancy of input strings in accordance withembodiments of the present invention.

FIG. 7 depicts a process for processing historical queries in accordancewith embodiments of the present invention.

FIG. 8 depicts a portion of an exemplary table used in processinghistorical queries in accordance with embodiments of the presentinvention.

FIG. 9 depicts data structures associated with a query completion tableusing suffixes in accordance with embodiments of the present invention.

FIG. 10 depicts a portion of an exemplary query completion table inaccordance with embodiments of the present invention.

FIG. 11 depicts an exemplary screen shot in accordance with embodimentsof the present invention.

FIG. 12 depicts a search engine suitable for implementing embodiments ofthe present invention.

FIG. 13 depicts a client suitable for implementing embodiments of thepresent invention.

DESCRIPTION OF EMBODIMENTS

In one embodiment of the invention, portions of a user's query aretransmitted to a search engine before the user has finished entering thecomplete query. The search engine uses the transmitted portion of thequery to predict the user's final query. These predictions aretransmitted back to the user. If one of the predictions is the user'sintended query, then the user can select that predicted query withouthaving to complete entry of the query. In some embodiments, the selectedquery is transmitted to the search engine, which returns a set of queryresults corresponding to the selected query.

FIG. 1 illustrates an exemplary embodiment of the invention including aclient system 104 and a search engine 106. As a user enters a searchquery, the user's input is monitored by the client system (108). Priorto the user signaling completion of the search query, a portion of theuser's query is sent from the client system 104 to the search engine 106(110). The portion of the query may be a few characters, a search term,or more than one search term. In some embodiments, the partial input isin the form of a uniform resource locator (URL) such as that describedin RFC 1738, promulgated by the Internet Engineering Task Force, whichcan be used to identify resources within computers and computernetworks. The search engine 106 receives the partial query forprocessing (112) and makes predictions as to user's contemplatedcomplete query (or URL) (114). The predictions are ordered according inaccordance with a ranking criteria. For example, in some embodimentsqueries having a higher frequency of submission are ordered beforequeries having lower frequencies of submission. The search engine 106uses a number of query completion tables (described in more detailbelow) to assist in making the ordered predictions. The query completiontables are created using previously entered search queries received bythe search engine 106. In some embodiments, the previous queries includesearch queries from a community of users. The predicted queries are sentback to the client system 106 (116) and then presented to the user(118). If one of the predicted queries is what the user intended as thedesired query, the user may select this predicted query and proceedwithout having to finish entering the desired query. If the predictedqueries do not reflect what the user had in mind, then the user maycontinue entering the desired search query.

FIG. 2 illustrates a searching system 200 according to some embodimentsof the invention and shows various functional components which will bereferred to in the detailed discussion which follows. The search system200 may include one or more client systems 202. Each client system 202has a search assistant 204. The client systems 202 are connected to acommunications network 206. The communications network 206 connects theclient systems 202 to a search engine 208. Search engine 208 includes aquery server 210 connected to the communications network 206, aprediction server 212, a URL database 225 and a query processingcontroller 214.

The query server 210 includes a client communications module 216, aquery receipt, processing and response module 218, a partial queryreceipt, processing and response module 220, a user informationprocessing module 222, and a query log 224, all interconnected. In someembodiments, fewer and/or additional modules or functions are includedin the query server 210. The modules shown in FIG. 2 as being part ofquery server 210 represent functions performed in an exemplaryembodiment. The prediction server 212 is connected to partial queryreceipt, processing and response module, the URL database 225 and toquery log 224.

The query processing controller 214 is connected to an inverse documentindex 228, a document database 230, a query cache 232 and the URLdatabase 225. The cache 232 may include an index 234 the function ofwhich is to locate entries in the cached results 236. The cached results236 may include a cache entry for an identified query 238 and a cacheentry for an anticipated query 240. The inverse document index 228 anddocument database 230 are sometimes collectively called the documentdatabase. In some embodiments, “searching the document database” meanssearching the inverse document index 228 to identify documents matchinga specified search query or term.

Although illustrated as discrete blocks in the figure, FIG. 2 isintended more as a functional description of an embodiment of theinvention rather than a structural mapping of the functional elements.One of ordinary skill in the art would recognize that an actualimplementation might have the functional elements grouped or split amongvarious components. For example, the query log 224 may be distinct fromthe query server 210. In some embodiments the query log 224 may bestored on one or more servers whose primary function is to store andprocess query log information. Similarly, the URL database 225 may bestored on or more servers whose primary purpose is to store and processinformation about known URLs.

FIG. 3 illustrates an embodiment of the invention that may beimplemented in the search assistant 204 of a client system 202 (FIG. 2).The search assistant 204 monitors the user's entry of a search query onthe client system 104 (302). In some embodiments, the search assistant204 monitors the user's entry of a uniform resource locator (URL) inputstring, such as in the address field of a browser window. The user mayenter the search query or URL in a number of ways including a browserwindow, a search tool, or any other input mechanism. The searchassistant 204 may identify two different scenarios. First, the searchassistant 204 receives or identifies a final input (302-final input)when the user has indicated completion of the input string or selected apresented prediction. Second, the search assistant 204 receives oridentifies a partial input (302-partial input) when an input isidentified prior to when the user indicates completion of the inputstring (as described below). In a third, optional scenario (described inmore detail below), the search assistant 204 determines or receivesnotification that the user has not selected one of the predictionswithin a specified time period.

When a final input or selection (302-final input) is identified as asearch query, the input is transmitted to the search engine 208 (304)for processing. The search engine 208 returns a set of search results,which is received by the search assistant 204 (306) or by a clientapplication, such as a browser application. The list of search resultsis presented to the user such that the user may select one of thedocuments for further examination (e.g., visually or aurally). When thefinal input is a URL, the request is transmitted to the appropriatedocument host (304) and the document, if available, is returned (306).After the response is received (306), the user's input activities areagain monitored (302). In some embodiments, the URL request is sent tothe search engine 208 for logging and the request is redirected to theappropriate document host.

A final input may be identified by the search assistant 204 in a numberof ways such as when the user enters a carriage return, or equivalentcharacter, selects a search button in a graphical user interface (GUI)presented to the user during entry of the search query, or by possiblyselecting one of a set of possible queries presented to the user duringentry of the search query. One of ordinary skill in the art willrecognize a number of ways to signal the final entry of the searchquery.

Prior to the user signaling a final input, a partial input may beidentified (302-partial input). A partial input may be identified in anumber of ways. For a search query, a partial input includes a singlesearch term of the search query, multiple search terms, or a predefineda number of characters of a search term.

In some embodiments, a partial input is identified by detecting entry ofdelimiter or other character (e.g., without limitation, a quotecharacter, a period, a parenthesis character, a slash character, arrowkey detection or tab entry). Entry of a delimiting character mayindicate that a user has finished entering a desired term or portion ofthe input and is moving onto the next search term or portion.

In some embodiments, a partial input is identified by detecting entry ofa pre-determined number of characters. In these embodiments, the inputcontains a number of characters less than a full input but it may stilldesirable to identify the partial input before the user has entered allof the characters. This technique is desirable, for example, when thesearch term or URL contains a large number of characters or when thepre-determined number of characters is large enough to result in usefulpredictions.

In some embodiments, a partial input is identified by detecting theabsence of a character being entered within a period of time, theabsence representing a pause by the user. The pause may signify that theuser has entered one search term or portion of the complete string buthas not entered the space key (or other delimiting character) to startentering another term or signify that the search query is in factcomplete but the user has not yet so signaled.

Regardless of the way in which the partial input is identified, it istransmitted to the search engine 208 (308) for processing. In responseto the partial search query, the search engine 208 returns a set ofordered predicted search queries and/or URLs (310) which is presented tothe user (312) ordered in accordance with a ranking criteria. In someembodiments, the predicted search queries are ordered in accordance witha frequency of submission by a community of users. In some embodiments,the search queries are ordered, at least in part, in accordance with alast time/date value that the query was submitted. In some embodiments,the search queries are ordered in accordance with personalizationinformation, such as user personalization information or communityinformation. For instance, user personalization information may includeinformation about subjects, concepts or categories of information thatare of interest to the user. The user personalization information may beprovided directly by the user, or may be inferred with the user'spermission from the user's prior search or browsing activities, or maybe based at least in part on information about a group associated withthe user or to which the user belongs (e.g., as a member, or as anemployee). The set of predicted search queries may be initially orderedin accordance with a first ranking criteria, such as predefinedpopularity criteria, and then reordered if any of the predicted searchqueries match the user personalization information of the user so as toplace the matching predicted search queries at or closer to the top ofthe ordered set of predicted search queries.

One skilled in the art will recognize a number of ways to present thepredicted search queries and/or URLs to the user. For example, thepredicted search queries and/or URLs might be presented in a drop downmenu. Regardless of the manner in which the predicted queries and/orURLs are presented to the user, the user may select one of the queriesand/or URLs if the user determines that one of the predictions matchesthe intended entry. In some instances, the predictions may provide theuser with additional information which had not been considered. Forexample, a user may have one query in mind as part of a search strategy,but seeing the predicted results causes the user to alter the inputstrategy. Once the set is presented (312), the user's input is againmonitored. If the user selects one of the predictions (302-final), therequest is transmitted either to the search engine 208 as a searchrequest or to a resource host as a URL request (304), as applicable.After the request is transmitted, the user's input activities are againmonitored (302). As mentioned above, in some embodiments, the URLrequest is transmitted to search engine 208 for logging purposes.

If, on the other hand, the user has not selected one of the predictionswithin a specified time period, then it is likely that the user did notfind a satisfactory prediction in the predictions that were initiallyreturned. For example, a user's intended input did not have a highenough ranking value to be included in the set of ordered predictions.Accordingly, in some optional embodiments, if the user has not selectedone of the predictions within a specified period of time (e.g., 5 or 10seconds) (302-timeout), then a request is sent to the search engine 208for another set of predictions (318). The subsequent set of predictionscould include predictions having ranking values lower than the setpreviously submitted. Alternately, a second set of criteria may be usedto identify predictions in the second set, where the second set ofcriteria are different than a first set of criteria used to select andrank the first set of predictions. For instance, one of the two sets mayuse selection criteria that takes into account personal informationabout the requester while the other set does not.

In some embodiments, the search engine 208 may optionally returnpredicted results (320). This activity may overlap with receiving thepredictions (310) and is indicated by the dashed line to 320 in FIG. 3.The predicted results are presented (320) and the monitoring of the userresumes (302). The predicted results correspond to documents orinformation that would have been returned based on the request being oneor more of the predicted queries or URLs. In some embodiments, theresults are search results based on one or more of the predictedqueries. For example, in some embodiments, the results presented (320)may be one or more documents relevant to one or more of the predictedqueries or predicted URLs. Accordingly, the user may have predictedresults presented that match a desired request before the user finishesentering the request (e.g., search request or URL request). In suchsituations, the processing latency as viewed by the user is effectivelyreduced to less than zero because the user did not have to complete theinput to obtain the desired result.

FIG. 4 illustrates the activity occurring in the search engine 208 whenit receives an input according to some embodiments. The search engine208 receives the input and determines whether the input indicates afinal input or a partial input (402). If the search engine 208determines that the received input is a final query (402-final query)then it determines whether search results relevant to the query arepresent in the cache 232 (404). If the relevant search results are inthe cache 232 (404-yes), then those results are returned to the client104 (406). On the other hand, if the search results are not in the cache(404-no), then search results relevant to the query are obtained (408),and then returned to the client 104 (406). In some embodiments, a URLrequest, when complete, is not received by the search engine 208 becausethe search assistant sends the request to the resource host. In someembodiments, the URL request is received by the search engine 208 fortracking purposes (such as storage in a URL database) and the request isredirected to the resource host by the search engine 208.

If the search engine 208 determines that the received input was apartial input (402-partial), then it determines a set of ordered matchesthat correspond to the partial input (410), and transmits the set to theclient 104 (412). As will be explained below, in some embodiments, theset of ordered matches sent to the client 104 is one of manypre-computed sets of ordered matches. Although the following operationsare described in terms of a partial query, the same techniques areequally applicable to partial inputs of URLs. In some embodiments, theset of ordered matches returned is relevant only to queries. In someembodiments, the set of ordered matches is relevant to only URLs. And,in some embodiments, the set of ordered matches is relevant to bothqueries and URLs.

To aid in understanding how, according to some embodiments, the searchengine 208 determines which set of ordered matches to return, it ishelpful to begin with a description of how the ordered sets are createdand used. FIG. 5 shows a set of data structures associated withhistorical queries (i.e., queries previously submitted) used forpredicting queries corresponding to partially entered queries. A searchengine or user input prediction system may also include a parallel setof data structures associated with historical URLs (i.e., URLspreviously submitted) used for predicting URLs corresponding topartially entered URLs.

Referring to FIG. 5, a historical query log 502 is filtered by one ormore filters 504 to create an authorized historical queries list 506. Anordered set builder 508 creates one or more fingerprint-to-table maps510 from the authorized historical queries list 506 based on certaincriteria. When the partial query is transmitted (FIG. 3, 308), it isreceived at the search engine 208 as partial query 513. A hash function514 is applied to the partial query 513 to create a fingerprint, i.e., ab-bit binary value (e.g., a 64-bit number). An applicablefingerprint-to-table map 510 (e.g., 510-1) is searched for thefingerprint (e.g., 515) to identify a query completion table 516associated with the fingerprint. An applicable fingerprint-to-table map510 may be selected based on a number of different factors associatedwith a user or a request. Information used to select the applicablefingerprint-to-table map 510 could come from profile informationprovided by the user or the search assistant 204, information gleanedfrom the request itself (e.g., language), information associated withthe user in user information processing module 222, or other sources.The query completion table 516 provides an ordered set of predictedqueries relevant to the partial query 513.

In some embodiments, some preprocessing occurs to the partial querybefore the fingerprint is created. In one embodiment, conspicuouslymisspelled words in the partial query are identified and corrected bycomparing one or more of the complete search terms with entries in adictionary. One or more predicted results from queries including thecorrectly spelled word are merged with the predicted results returned tothe user. In another example, common prefix information could be removed(e.g., “http://” or “www.”). In some embodiments, the terms in the queryare analyzed to extract concepts embodied in the search terms indicatinga particular category of information (e.g., “technology, “food”, “music”or “animals”). One or more predicted results from queries related to oneor more of the extracted concepts are merged with the predicted resultsreturned to the user.

The historical query log 502 contains a log of previously submittedqueries received by the search engine 208 over a period of time. In someembodiments, the queries are from a particular user. In someembodiments, the queries are from a community of users sharing at leastone similar characteristic such as belonging to the same workgroup,using the same language, having an internet address associated with thesame country or geographic region, or the like. The selection of thecommunity determines the pool of previously submitted queries from whichthe predictions are drawn. Different communities would tend to producedifferent sets of predictions.

The historical query log 502 may also contain information associatedwith each submitted query. In some embodiments, the query informationincludes the date and time that the query was submitted or received. Insome embodiments, the query information includes the internet protocol(IP) address from where the query was submitted. In some embodiments,the query information contains a unique source identifier for the query(e.g., a value from a cookie stored on the user's machine where thevalue is associated with a particular search assistant 204). While theunique identifier does not directly identify any particular user, it maybe associated with a particular installation of a browser or toolbar. Insome embodiments, a user may permit direct identification with theunique identifier for certain personalization features which could beaccessed using user information processing module 222.

In some embodiments, a fingerprint value is associated with the query.The fingerprint value may be calculated by applying a hash function tothe query string. In some embodiments, other types of meta-data areassociated and stored with the query such as the query language or otherinformation which might be provided by the user or search assistant inaccordance with user preferences (e.g., identification or profileinformation indicating certain preferences of the user). In someembodiments, the meta-information includes category or conceptinformation gleaned from analyzing the terms in the query. The period oftime over which the queries are logged is a variable and represents atradeoff between storage capacity and potential accuracy of thepredictions. It is likely that longer periods of time will moreaccurately reflect a query's popularity over the entire community,however, this requires more storage. On the other hand, a popularityranking over a long period of time may not reflect a transientpopularity for current events.

One or more filters 504 are used to determine queries authorized forfurther processing. For example, filters can eliminate certain queriesbased on various criteria. In some embodiments, a privacy filter 504prevents queries which have not been received from more than a certainnumber of unique submitters to be included in the authorized historicalqueries list 506. This could be accomplished by examining the uniqueidentifier associated with each query, if one exists, and identifyingonly those queries which have been submitted by at least n uniquesubmitters, where n is a number chosen based on privacy concerns (e.g.,three or five unique submitters). In some embodiments, the filters 504include a filter that eliminates queries which are infrequentlysubmitted and therefore not likely to be selected by a user. In someembodiments, the filters 504 include an appropriateness filter 504 thatblocks certain queries from inclusion based on a number of differentfactors such as the presence of one or more particular keywords in aquery, and/or based on the content of the search results or documentsthat correspond to the query. Other types of filters could be easilyimagined. For example, a filter could block queries submitted earlierthan a particular historical point in time, such that the authorizedhistorical queries list 506 represent recently submitted queries. Whatis considered recent depends on the embodiment (e.g., hours, days,weeks, months, or years). In yet another example, an anti-spoofingfilter 504 could be use to prevent the query/URL prediction system frombeing spoofed by a large number of a artificially generated queries orURL submissions. For instance, an anti-spoofing filter 504 might filterout multiple submissions of the same query or URL received from the sameuser or from the same client computer.

After the historical query log 502 has been filtered by the one or morefilters 504, the result is the authorized historical queries list 506,i.e., a list of queries eligible to be returned to the user as suggestedquery completions. The authorized historical queries list 506 includeshistorical query 506-1 to historical query 506-q, where q represents thenumber of queries included in the authorized historical queries list506. The value of q could be equal to or less than the total number ofqueries filtered from the historical query log 502. For example,filtered queries having frequencies less than a predetermined thresholdcould be ignored. In some embodiments, a new authorized historicalqueries list 506 is built periodically such as hourly, nightly, weeklyor other periods. In some embodiments, the current authorized historicalqueries list 506 is updated based on recent entries to the query log224, after applicable filtering.

Each query in authorized historical queries list 506 (e.g., 506-1)includes the query, its frequency and, optionally, meta-information. Thequery could be a string of characters. The frequency informationindicates how many times the query was submitted over a period of time.As mentioned above, a unique identifier may be used to count the numberof times unique searchers submitted the query. Because different usersmay use multiple search assistants or some queries may not include aunique identifier, the frequency number may not represent the actualnumber of unique users submitting the search query. Nonetheless, aquery's frequency can act as a proxy for a query's popularity. In someembodiments, the authorized historical queries list 506 is orderedalphabetically based on the query. In other embodiments, the authorizedhistorical queries list 506 is ordered based on the query frequency.

The meta-information, may include information similar to themeta-information discussed above in reference to the historical querylog 502 (e.g., location or language information). In some instances, thesame query will have entries in the historical query log 502 whichdiffer not in the query string, but in the meta-information.Accordingly, the meta-information for a particular authorized historicalquery 506-1 may indicate differing meta-information for the same query.For example, the meta-information for a query submitted from twodifferent locations, such as Europe or Asia, would indicate bothlocations as a source location of the query. The meta-information couldalso indicate user profiling information to indicate what types of usershad submitted the query. One of ordinary skill in the art will recognizevarious types of meta-information that might be useful to categorize orgroup queries related by common set of characteristics (e.g., languageor location). In some embodiments, the query terms are analyzed andassociated with certain categories of information. For example, a searchquery including “dog” and “breed” is associated with a “dog” or “animal”category. The meta-information in some embodiments, contains thiscategory information. In some embodiments, meta-information for a singleentry in the authorized historical queries list 506 is produced from themultiple queries, for example, by providing the date/time of the queryas the last date/time value that the query was submitted.

The ordered set builder 508 uses the authorized historical queries list506 to build a set of fingerprint-to-table maps 510-1 to 510-t, where trepresents the number of fingerprint-to-table maps 510 built. Any numberof fingerprint-to-table maps 510 could be built depending on the numberof ways desired to categorize predicted queries. Each of thefingerprint-to-table maps 510 contain sets of ordered predictions eachmapped to a particular partial query. The fingerprint-to-table maps 510differ based on characteristics of information such as might be found inthe meta-information. For example, there may be one fingerprint-to-tablemap 510 for each language (e.g., one for English language queries; oneof French language queries; one for Japanese language queries).Similarly, different fingerprint-to-table maps 510 could be created forgeographical regions. As another example, different fingerprint-to-tablemaps 510 could be created from queries from particular IP addresses orgroups of addresses, such as those from a particular network or aparticular group of individuals (e.g., a corporation). Using themeta-information to create different fingerprint-to-table maps 510,allows the predictions to be based on users having characteristicssimilar to that of the searcher and which should increase the likelihoodof a correct prediction. In some embodiments, differentfingerprint-to-table maps 510 are based on different ranking criteriafor the queries (e.g., frequency, last date/time, personalizationcategories or characteristics, and so on). In some embodiments,different fingerprint-to-table maps 510 are based on the type of userinput (i.e., query string or URL).

Using fingerprint-to-table map 510-1 as an example, each of thefingerprint-to-table maps 510 includes a number of entries 512-1 to512-f, where f represents the number of entries in thefingerprint-to-table map 510-1. The number of entries in any particularfingerprint-to-table map 510 depends on the number of different partialqueries for which the prediction server 212 will return predictions.

Each of the entries in the fingerprint-to-table map 510-1 (e.g., 512-2)includes a fingerprint (e.g., fingerprint (2) 515) and a querycompletion table (e.g., query completion table (2) 5 16). Thefingerprint-to-table maps 510 serve to associate fingerprints (e.g.,fingerprint (2) 515) to query completion tables (e.g., query completiontable (2) 516)).

The fingerprint (2) 515 represents a fingerprint value for a partialquery. The fingerprint (2) 515 may be calculated, for example, byapplying a hash function to a partial query to create a b-bit binaryvalue (e.g., a 64-bit number). Accordingly, the fingerprint-to-table map510-1 may be searched for a fingerprint which matches the fingerprint ofthe partial query 513 (e.g., fingerprint 515).

The query completion table (2) 516 contains a list of query completionfingerprints 518-1 to 518-n, where n represents the number of querycompletion fingerprints in the query completion table (2) 516. In someembodiments, n represents the number of predicted queries returned tothe search assistant 204 (e.g., 10 predicted queries). In otherembodiments, less than n are returned. In some embodiments, n is greaterthan the number of results to be returned in a set of ordered queries.In some embodiments, n is twice the number to be returned and the firstn/2 are provided as a first set of ordered predicted queries and thesecond n/2 are provided as a subsequent set of ordered predicted queries(e.g., the second set of 10 predicted queries is sent subsequent to thefirst set of 10 upon certain conditions). In some embodiments, the querycompletion table 516 includes a score for each query completionfingerprint 518. The scores are used to order the items in the querycompletion table 516, in descending score order. In some embodiments,the scores are a permanent part of the query completion table, while inother embodiments the scores are deleted or not kept after the formationof the query completion tables 516 is completed.

Each query completion fingerprint 518 is a fingerprint value associatedwith a complete query. The query completion fingerprint 518 (e.g.,518-2) maps to an associated query record 520. The query record 520includes a query string 522 which contains the query string for thecomplete query. This approach facilitates entries in multiple querycompletion tables 516 referencing the same query string 522, yet onlyrequiring that the actual query string be stored in a single location(e.g., query string 522). In some embodiments, however, the querystrings 522 may be stored in place of the query completion fingerprints518 in a query completion table 516. In some embodiments, query record520 for URL strings include a URL title 524 representing a titleassociated with the URL. In some embodiments, additional informationassociated with a URL is provided in information 526.

In some embodiments, the query completion table 516 is an ordered listof n queries relevant to the partial query associated with thefingerprint 515. The list may be ordered in accordance with variousranking criteria such as frequency, date/time of submission, and so on.In some embodiments, the ranking criteria may take into account two ormore factors, such as both frequency and date/time or submission, bygenerating a score or rank for each query that takes into account eachof the two or more factors. In a simple example, historical querieswhose date/time is more than 24 hours in the past may contribute a valueof “1” to the ranking score of the query, while historical queries whosedate/time is within the last 24 hours may contribute a value of “2” tothe ranking score of the query. In this example, recent historicalqueries are weighted more heavily than older historical queries indetermining the rank of each authorized historical query.

In some embodiments, the ordered set builder 506 creates or updates thefingerprint-to-table maps 510 and associated query completion tables 516and/or 910 (FIG. 9) periodically (e.g., hourly, daily, weekly) so as tokeep the query and/or URL predictions produced by the prediction serverconsistent with queries and/or URLs recently submitted by the applicablecommunity of users.

Referring to FIG. 6, a partial query of “ho” 602 might have a set ofcompleted queries 604 as being relevant to the partial query 602. Thefirst position of the set of completed queries 604 includes the queryhaving the highest frequency value (e.g., “hotmail”), it is followed inthe second position with the query having the next highest frequencyvalue (e.g., “hot dogs”), and so on. In this example, a complete query'srelevancy to a given partial query is determined by the presence of thepartial query at the beginning of the complete query (e.g., thecharacters of “ho” begin the complete queries of “hotmail” and “hotelsin San Francisco”). In other embodiments, the relevancy is determined bythe presence of the partial query at the beginning of a search termlocated anywhere in the complete query, as illustrated by the set ofcompleted queries 606 (e.g., the characters “ho” are found at thebeginning of “hotmail” and at the beginning of the second search term in“cheap hotels in Cape Town”).

To create the set of query completion tables 516, one of the queries inthe authorized historical queries 506 is selected (FIG. 7, 702). In someembodiments, only queries having the desired meta-information areprocessed (e.g., queries in the English language). The first partialquery is identified from the selected query (704). In one embodiment,the first partial query is the first character of the selected query(i.e., “h” for a query string of “hot dog ingredients”). In someembodiments, preprocessing is applied before partial queries areidentified (e.g., stripping off “http://” or “www.”). An entry is madein a table which indicates the partial query, the complete querycorresponding to the partial query and its frequency. In otherembodiments, other information which is used for ranking is stored(e.g., date/time values, or a ranking score computed based on two ormore factors). If the partial query does not represent the entire query,then the query processing is not complete (708-no). Accordingly, thenext partial query is identified (710). In some embodiments, the nextpartial query is identified by adding the next additional character tothe partial query previously identified (i.e., “ho” for a query stringof “hot dog ingredients”). The process of identifying (710) and ofupdating of a query completion table (706) continues until the entirequery is processed (708-yes). If all of the queries have not yet beenprocessed (712-no), then the next query is selected and processed untilall queries are processed (712-yes). In some embodiments, as items areadded to a query completion table, the items are inserted so that theitems in the table are ordered in accordance with the rank or score. Inanother embodiment, all the query completion tables are sorted at theend of the table building process so that the items in each querycompletion table are ordered in accordance with the rank or score of theitems in the query completion table. In addition, one or more querycompletion tables may be truncated so that the table contains no morethan a predefined number of entries.

Referring to FIG. 8, an exemplary processing of the first fivecharacters of the query string of “hot dog ingredients” is illustratedin table 802 at 804 through 812. An exemplary processing of the firstfour characters of the query string of “hotmail” is illustrated at 814through 820.

In some embodiments, a query completion table for a given partial queryis created by identifying the n most frequently submitted queriesrelevant to the given partial query from the table and placing them inranked order such that the query having the highest rank (e.g., thehighest ranking score or frequency) is at the top of the list. Forexample, a query completion table for the partial query “hot” wouldinclude both complete query strings of 808 and 818. When the ranking isbased on frequency, the query string for “hotmail” would appear abovethe query string for “hot dog ingredients” because the frequency of thequery string in 818 (i.e., 300,000) is larger than that of the querystring in 808 (i.e., 100,000). In some embodiments, a URL's popularitycould be given a value assigned to a particular web page providing anindication of its importance among a set of web pages (e.g., PageRank).Accordingly, when the ordered set of prediction is returned to the user,the queries having a higher likelihood of being selected are presentedfirst. As mentioned above, other values could be used for ranking drawnfrom the meta-information (e.g., date/time values, or personalizationinformation).

Referring to FIGS. 9 and 10, in some embodiments the number of querycompletion tables is reduced by dividing the historical query stringsinto “chunks” of a predefined size C, such as 4 characters. The querycompletion tables for partial queries of length less than C remainunchanged. For partial queries whose length is at least C, the partialquery is divided into two portions: a prefix portion and a suffixportion. The length of the suffix portion, S, is equal to the length ofthe partial query (L) modulo C:S=L modulo C.where L is the length of the partial query. The length of the prefixportion, P, is the length of the partial query minus the length of thesuffix: P=L−S. Thus, for example, a partial query having a length of 10characters (e.g., “hot potato”), would have a suffix length S of 2 and aprefix length P of 8 when the chunk size C is 4.

When performing the process shown in FIG. 7, step 706, identifying orcreating a query completion table corresponding to a partial query isconceptually illustrated in FIG. 9. FIG. 9 schematically illustrates theprocess used both for generating query completion tables as well as forlookup when processing a user entered partial query. When the length ofthe partial query is less than the size of one “chunk”, C, the partialquery is mapped to a query fingerprint 515, for example by using a hashfunction 514 (FIG. 5). The fingerprint 515 is mapped to a querycompletion table 516 by a fingerprint to table map 510, which in turncontains query completion fingerprints 518 or pointers to a set of queryrecords 520 (which contain query strings 522, FIG. 5).

When the length of the partial query is at least the size of one chunk,C, the partial query 902 is decomposed into a prefix 904 and suffix 906,whose lengths are governed by the chunk size, as explained above. Afingerprint 908 is generated for the prefix 904, for example by applyinga hash function 514 to the prefix 904, and that fingerprint 908 is thenmapped to a “chunked” query completion table 910 by a fingerprint totable map 510. The structure of the chunked query completion table 910is different from the query completion table 516 shown in FIG. 5, inthat each entry 911 of the chunked query completion table 910 has asuffix entry 914 as well as a query completion fingerprint 912. Eachentry 911 may optionally include a score 916 as well, used for orderingthe entries in the query completion table 910. The suffix has a length,S, which can be anywhere from zero to C-1, and comprises the zero ormore characters of the partial query that are not included in the prefix904. In some embodiments, when generating the query completion tableentries 911 for a historical query, only one entry is made in eachchunked query completion table 910 that corresponds to the historicalquery. In particular, that one entry 911 contains the longest possiblesuffix for the historical query, up to C-1 characters long. In otherembodiments, up to C entries are made in each chunked query completiontable 910 for a particular historical query, one for each distinctsuffix.

FIG. 10 shows a set of query completion tables which contain entries 911corresponding to the historical query “hot potato”. This example assumesa chunk size, C, equal to four. In other embodiments the chunk size maybe 2, 3, 5, 6, 7, 8, or any other suitable value. The chunk size, C, maybe selected based on empirical information. The first three of the querycompletion tables shown in FIG. 10, 516-1 through 516-3, are for thepartial queries “h”, “ho” and “hot”, respectively. The next two querycompletion tables, 910-1 and 910-2 correspond to the partial queries“hot pot” and “hot potato”, respectively, having partial query lengthsof 7 and 10. Referring back to step 710 of FIG. 7, with each iterationof the loop formed in part by step 710, the length of the partialqueries initially increases by steps of 1 character, until a length ofC-1 is reached, and then the length of the partial queries increases bysteps of C characters, until the full length of the historical query isreached.

The entries 911 of each chunked query completion table are orderedaccording to the ranking values (represented by scores 916) of the querystrings identified by the query completion fingerprints 912 in theentries 911. For partial queries having less than C characters, thenumber of queries in the associated query completion table 516 is afirst value (e.g., 10 or 20), which may represent the number of queriesto return as predictions. In some embodiments, the maximum number (e.g.,a number between 1000 and 10,000) of entries 911 in each chunked querycompletion table 910 is significantly greater than the first value. Eachchunked query completion table 910 may take the place of dozens orhundreds of ordinary query completion tables. Therefore, each chunkedquery completion table 910 is sized so as to contain a number (p) ofentries corresponding to all or almost all of the authorized historicalqueries having a prefix portion that corresponds to the chunked querycompletion table, while not being so long as to cause an undue latencyin generating a list of predicted queries for a user specified partialquery.

After the query completion tables 516, 910 and fingerprint-to-table maps510 have been generated from a set of historical queries, these samedata structures (or copies thereof) are used for identify a predictedset of queries corresponding to a user entered partial query. As shownin FIG. 9, the user entered partial query is first mapped to a queryfingerprint 515 or 908, by applying a hash function 514 either to theentire partial query 902 or to a prefix portion 904 of the partialquery, as determined by the length of the partial query. The queryfingerprint 515 or 904 is then mapped to a query completion table 516 or910 by performing a lookup of the query fingerprint in afingerprint-to-table map 510. Finally, an ordered set of up to Npredicted queries is extracted from the identified query completiontable. When the length of the partial query is less than the chunk size,the ordered set of predicted queries are the top N queries in theidentified query completion table. When the length of the partial queryis equal to or longer than the chunk size, the identified querycompletion table is searched for the top N items that match the suffixof the partial query. Since the entries in the query completion table910 are ordered in decreasing rank, the process of searching formatching entries begins at the top and continues until the desirednumber (N) of predictions to return is obtained (e.g., 10) or until theend of the query completion table 910 is reached. A “match” exists whenthe suffix 906 of the partial query is the same as the correspondingportion of the suffix 914 in an entry 911. For instance, referring toFIG. 10, a one letter suffix of <p> matches entries 911-3 and 911-4having suffixes of <pot> and <pal>, respectively. An empty suffix (alsocalled a null string) having length zero matches all entries in a querycompletion table, and therefore when the suffix portion of a partialquery is a null string, the top N items in the table are returned as thepredicted queries.

As noted above, the data structures and processes for identifying anordered set of predicted URLs that correspond to a partial URL are thesame as the data structures and processes, described above, foridentifying an ordered set of predicted queries that correspond to auser entered partial query. Even though URLs and query strings may havedifferent uses, both may be treated as a string of characters or symbolswhose value may be predicted after partial entry by a user. In someembodiments, the set of “historical URLs” from which a set of URLcompletion tables 1234 (FIG. 12) and URL fingerprint-to-table maps 1236(FIG. 12) are built may comprise URLs entered by a particular user or aset or community of users. In another embodiment, the set of “historicalURLs” from which a set of URL completion tables and URL fingerprint totable maps are built may comprise the URLs of documents stored in adocument database, such as the document database of a search engine.

FIG. 11 illustrates a user's view when using a browser and toolbaraccording to some embodiments of the invention. A browser 1102 includesa toolbar 1104 including a text entry box 1106 depicting the entry of apartial query <hot>. In response to detecting the partial query andultimately receiving the predicted queries from the query server, thepredictions are displayed in display area 1108 for possible selection bythe user. Similarly, while not shown, in response to detecting userentry of a partial URL in an address bar 1110, an ordered set ofpredicted URLs may be displayed in a display area (not shown)immediately below or adjacent the address bar 1110 for possibleselection by the user.

Referring to FIG. 12, an embodiment of a search engine 1202 thatimplements the methods and data structures described above includes oneor more processing units (CPU's) 1204, one or more network or othercommunications interfaces 1206, a memory 1208, and one or morecommunication buses 1210 for interconnecting these components. Thesearch engine 1202 may optionally include a user interface 1212comprising a display device 1214 and a keyboard 1216. The memory 1208may include high speed random access memory and may also includenon-volatile memory, such as one or more magnetic or optical storagedisks. The memory 1208 may include mass storage that is remotely locatedfrom CPU's 1204. The memory 1208 may store the following elements, or asubset of such elements:

-   -   an operating system 1218 that includes procedures for handling        various basic system services and for performing hardware        dependent tasks;    -   a network communication module (or instructions) 1220 that is        used for connecting the search engine 1202 to other computers        via the one or more communications interfaces 1206 (wired or        wireless), such as the Internet, other wide area networks, local        area networks, metropolitan area networks, and so on;    -   a query server 210 for receiving full or partial queries and        returning search results and predicted queries and predicted        search results; and    -   a prediction server 212 for receiving a partial query and        returning a set of ordered predictions of queries or URLs.

In some embodiments, the query server 210 includes the followingelements, or a subset of such elements: a client communications module216 for receiving and transmitting information; a query receipt,processing and response module 218 for receiving and responding to fullsearch queries; a partial query receipt, processing and response module220 for receiving and responding to full search queries; a userinformation and processing module 222 for accessing user informationfrom a user information database 1226, which includes respective userprofiles 1228 for a plurality of users; a query log 224 for storinginformation about previously submitted queries, and a URL log ordatabase 225. In some embodiments, the query server 210 includes asubset of these modules. In some embodiments, the query server 210includes additional modules.

In some embodiments, the prediction server 212 includes the followingelements, or a subset of such elements:

-   -   a query receiving module (or instructions) 1230 for receiving a        partial query;    -   a query/URL completion table builder (or instructions) 1232 for        generating query completion tables 516, 910 and query        fingerprint-to-table maps 510; in some embodiments, the        query/URL completion table builder 1232 may also generate URL        completion tables 1234 and URL fingerprint-to-table maps 1236;        and    -   a prediction module (or instructions) 1238 for obtaining a set        of predicted queries or URLs.

In some embodiments, the prediction server 212 may also include one ormore of the following:

-   -   a personalization module (or instructions) 1240 for selecting        the set of predicted queries based, at least in part, on certain        user profile information;    -   a concept module (or instructions) 1242 for determining the        concepts associated with a particular query;    -   a community characteristics module (or instructions) 1244 for        determining a set of characteristics associated with a community        of users; and    -   a spelling module (or instructions) 1246 for identifying        alternative spellings of a received query or query term.

In some embodiments, one or more of the user information processingmodule 222, personalization module 1240, concept module 1242, communitycharacteristics module 1244 and spell module 1246 are not implemented.When implemented, the user profiles 1228 of the user informationprocessing module 222 may contain information suitable for selecting orordering predicted queries or URLs. For instance, a user profile 1228may identify categories of information that are of interest to aparticular user. A user profile 1228 may also contain informationassociated with a community of users to which a user belongs or withwhich the user is associated. The user information processing module 222may merge personal information with the community information togenerate a user profile 1228.

When implemented, the concept module 1242 may map historical queries toconcepts or categories of information, suitable for matching with theinformation in a user profile 1228. Similarly, the concept module 1242may be configured to map historical URLs to concepts or categories ofinformation, for instance by determining a set of primary concepts,subjects or categories of information in the content of the documentscorresponding to the historical URLs. The concept, subject or categoryinformation identified by the concept module 1242 may be stored in theentries of the query completion tables or URL completion tables, or inthe query records or URL records identified by the query/URL completiontables. When processing a partial query or URL, the set of predictedqueries or URLs may be reordered so that the predicted queries or URLswhose concept, subject or category information matches the informationin the user profile of the requesting user are placed higher in the listof predicted queries or URLs than those predicted queries or URLs whoseconcept or category information does not match the information in theuser profile of the requesting user.

In another embodiment, the concept module 1242 may be configured to mapone or more terms in a partial query to one or more substitute terms inaccordance with a conceptual or category mapping of those terms. Anordered set of predicted queries are generated for a partial querycontaining the one or more substitute terms, and those predicted queriesare then transmitted to the user, either separately or merged with theresults produced using the partial query as entered by the user.

FIG. 12 depicts the internal structure of a search engine 1202 in oneembodiment. It should be understood that in some other embodiments thesearch engine 1202 may be implemented using multiple servers so as toimprove its throughput and reliability. For instance the query log 224could be implemented on a distinct server that communications with andworks in conjunction with other ones of the servers in the search engine1202.

Although the discussion herein has been made with reference to a searchengine designed for use with documents remotely located from the searchrequestor, it should be understood that the concepts disclosed hereinare equally applicable to other search environments. For example, thesame techniques described herein could apply to queries against any typeof information repository against which queries, or searches, are run(e.g., an address book, a product information database, a file server, aweb site and so on). Accordingly, the term “search engine” should bebroadly construed to encompass all such uses.

Referring to FIG. 13, an embodiment of a client system 1300 thatimplements the methods described above includes one or more processingunits (CPU's) 1302, one or more network or other communicationsinterfaces 1304, memory 1306, and one or more communication buses 1308for interconnecting these components. The search engine 1300 mayoptionally include a user interface 1310 comprising a display device1312 and/or a keyboard 1314. Memory 1306 may include high speed randomaccess memory and may also include non-volatile memory, such as one ormore magnetic or optical storage disks. The memory 1306 may include massstorage that is remotely located from CPU's 1302. The memory 1306 maystore:

-   -   an operating system 1316 that includes procedures for handling        various basic system services and for performing hardware        dependent tasks;    -   a network communication module (or instructions) 1318 that is        used for connecting the client system 1300 to other computers        via the one or more communications network interfaces 1304 and        one or more communications networks, such as the Internet, other        wide area networks, local area networks, metropolitan area        networks, and so on; and    -   a browser or tool 1320 for interfacing with a user to input        search queries, and for displaying search results; and    -   a search assistant 1322.

In some embodiments, the search assistant 1322 is separate from thebrowser/tool 1320, while in other embodiments the search assistant isincorporated in the browser/tool 1320.

The search assistant 1322 may include the following elements, or asubset of such elements: an entry and selection monitoring module (orinstructions) 1324 for monitoring the entry of search queries andselecting partial queries for transmission to the search engine; atransmission module (or instructions) 1326 for transmitting partialsearch queries and final search queries to the search engine; apredicted query receipt module (or instructions) 1328 for receivingpredicted queries; a predicted search results receipt module (orinstructions) 1330 for receiving predicted search results; displaymodule (or instructions) 1332 for displaying predictions and results;and optionally, a search results receipt module (or instructions) 1334for receiving search results. The transmission of final (i.e.,completed) queries, receiving search results for completed queries, anddisplaying such results may be handled by the browser/tool 1320, thesearch assistant 1322, or a combination thereof. The search assistant1322 may also provide a corresponding set of functions for handlingpartial and complete URLs, which may be handled by either the sameelements or a parallel set of elements as those described above.

Although illustrated in FIGS. 12 and 13 as distinct modules orcomponents, the various modules or components may be located orco-located within either the search engine or the client. For example,in some embodiments, portions of prediction server 212, and/or thevarious query completion tables 516 and/or 910 are resident on theclient system 202 or form part of the search assistant 204. For example,in some embodiments query completion tables and fingerprint-to-tablemaps for the most popular searches may be periodically downloaded to aclient system 202, thereby providing fully client-based query or URLinput prediction for at least some partially input queries or URLs.

In another embodiment, the search assistant 204 may include a localversion of the prediction server 212, for making search or URLpredictions based at least in part on prior searches and URL entries ofthe user. Alternately, or in addition, the local prediction server 212may generate predictions based on data downloaded from a search engineor remote prediction server. Further, the client assistant 204 may merge(e.g., by interleaving) locally generated and remotely generatedprediction sets for presentation to the user.

Although some of various drawings illustrate a number of logical stagesin a particular order, stages which are not order dependent may bereordered and other stages may be combined or broken out. While somereordering or other groupings are specifically mentioned, others will beobvious to those of ordinary skill in the art and so do not present anexhaustive list of alternatives. Moreover, it should be recognized thatthe stages could be implemented in hardware, firmware, software or anycombination thereof.

The foregoing description, for purpose of explanation, has beendescribed with reference to specific embodiments. However, theillustrative discussions above are not intended to be exhaustive or tolimit the invention to the precise forms disclosed. Many modificationsand variations are possible in view of the above teachings. Theembodiments were chosen and described in order to best explain theprinciples of the invention and its practical applications, to therebyenable others skilled in the art to best utilize the invention andvarious embodiments with various modifications as are suited to theparticular use contemplated.

1. A method for processing URL information, comprising: at a serversystem: receiving from a requester a partial URL; the receivingincluding receiving the partial URL from the requester prior to therequester performing an action indicating entry of a complete URL thatincludes the partial URL; obtaining a set of complete URLs correspondingto the partial URL, the complete URLs previously submitted by acommunity of users and ordered in accordance with a ranking criteria;and conveying the set of ordered complete URLs to the requester.
 2. Themethod of claim 1, including, at the server system, responding toreceiving the partial URL by obtaining the set of complete URLs,ordering the set of complete URLs in accordance with the rankingcriteria, and conveying the set of ordered URLs to the requestor.
 3. Themethod of claim 1, wherein the set of complete URLs is selected from aplurality of ordered sets of complete URLs.
 4. The method of claim 1,wherein the obtaining includes identifying two or more sets of completeURLs corresponding to the partial URL, and merging the sets of completeURLs in an interleaved fashion.
 5. The method of claim 1, wherein thecomplete URLs are ordered in accordance with an importance factorassociated with each complete URL.
 6. The method of claim 1, wherein thecomplete URLs are ordered by a frequency of submission associated witheach complete URL.
 7. The method of claim 1, including: receiving a userselection of a complete URL from among the set of ordered complete URLs;and requesting a resource associated with the user selected completeURL.
 8. The method of claim 1, including: receiving input identifying aselected complete URL from among the set of ordered complete URLs; andrequesting a resource associated with the selected complete URL.
 9. Themethod of claim 1, wherein the obtaining includes ordering the completeURLs in accordance with a time/date value of the complete URL.
 10. Themethod of claim 1, wherein the obtaining includes identifying at leastone complete URL in accordance with a modified spelling of the partialURL.
 11. The method of claim 1, further including modifying the setbased on characteristics associated with a user.
 12. The method of claim1, further including: sending the set to the requester; determining thatno selection corresponding to a sent URL in the set is received from therequester within a predetermined time period; and sending to therequestor a subsequent set of complete URLs ordered in accordance withthe ranking criteria.
 13. A computer-implemented program product, foruse with a computer system, the computer-implemented program productcomprising: one or more computer programs stored on a computer readablestorage medium, which when executed cause the computer system to:receive from a requestor a partial URL; wherein the partial URL isreceived from the requester prior to the requester performing an actionindicating entry of a complete URL that includes the partial URL; obtaina set of complete URLs previously submitted by a community of users, thecomplete URLs corresponding to the partial URL and ordered in accordancewith a ranking criteria; and convey the set of ordered complete URLs tothe requester.
 14. The computer-implemented program product of claim 13,wherein the one or more computer programs include instructions stored onthe computer readable storage medium, which when executed cause thecomputer system to: respond to receiving the partial URL by obtainingthe set of complete URLs, ordering the set of complete URLs inaccordance with the ranking criteria, and conveying the set of orderedURLs to the requestor.
 15. The computer-implemented program product ofclaim 13, wherein the one or more computer programs include instructionsstored on the computer readable storage medium, which when executedcause the computer system to: order the set of complete URLs inaccordance with a frequency of submission of each complete URL in theset of complete URLs.
 16. The computer-implemented program product ofclaim 13, wherein the one or more computer programs include instructionsstored on the computer readable storage medium, which when executedcause the computer system to: order the set of complete URLs inaccordance with an importance factor of each complete URL in the set ofcomplete URLs.
 17. The computer-implemented program product of claim 13,wherein the one or more computer programs include instructions stored onthe computer readable storage medium, which when executed cause thecomputer system to order the complete URLs in accordance with atime/date value of the complete URLs.
 18. The computer-implementedprogram product of claim 13, wherein the one or more computer programsinclude instructions stored on the computer readable storage medium,which when executed cause the computer system to identify at least onecomplete URL in accordance with a modified spelling of the partial URL.19. The computer-implemented program product of claim 13, wherein theone or more computer programs further include instructions stored on thecomputer readable storage medium, which when executed cause the computersystem to modify the set based on characteristics associated with therequestor.
 20. The computer-implemented program product of claim 13,wherein the one or more computer programs further include instructionsstored on the computer readable storage medium, which when executedcause the computer system: to send the set to the requestor; todetermine that no selection corresponding to a sent complete URL in theset is received from the requestor within a predetermined time period;and to send to the search requester a subsequent set of complete URLsordered in accordance with the ranking criteria.
 21. Thecomputer-implemented program product of claim 13, wherein the one ormore computer programs include instructions for selecting the respectiveset of complete URLs from a plurality of ordered sets of complete URLs.22. The computer-implemented program product of claim 13, wherein theone or more computer programs include instructions for identifying twoor more sets of complete URLs corresponding to the partial URL, andmerging the sets or complete URLs in an interleaved fashion.
 23. Asystem for processing query information, comprising: one or more centralprocessing units for executing programs; and memory to store data and tostore programs to be executed by the one or more central processingunits, the memory storing: sets of complete URLs previously submitted bya community of users; instructions for receiving a partial URL from arequestor; wherein the partial URL is received from the requester priorto the requester performing an action indicating entry of a complete URLthat includes the partial URL; instructions for obtaining a respectiveset of complete URLs previously submitted by a community of users, thecomplete URLs corresponding to the partial URL and ordered in accordancewith a ranking criteria; and instructions for conveying the respectiveset of ordered complete URLs to the requestor.
 24. The system of claim23, wherein the instructions for obtaining the respective set ofcomplete URLs include instructions for selecting the respective set ofcomplete URLs from a plurality of ordered sets of complete URLs.
 25. Thesystem of claim 23, wherein the instructions for obtaining therespective set of complete URLs include instructions for identifying twoor more sets of complete URLs corresponding to the partial URL, andmerging the sets or complete URLs in an interleaved fashion.
 26. Thesystem of claim 23, wherein the memory stores instructions forresponding to receiving the partial URL by obtaining the set of completeURLs, ordering the set of complete URLs in accordance with the rankingcriteria, and conveying the set of ordered URLs to the requestor. 27.The system of claim 23, wherein the instructions for obtaining therespective set of complete URLs include instructions for ordering therespective set of complete URLs in accordance with a frequency ofsubmission of each complete URL in the respective set of complete URLs.28. The system of claim 23, wherein the instructions for obtaining therespective set of complete URLs include instructions for ordering therespective set of complete URLS in accordance with an importance factorof each complete URL in the respective set of complete URLs.
 29. Thesystem of claim 23, wherein the instructions for obtaining therespective set of complete URLs include instructions for ordering therespective set of complete URLS in accordance with a time/date value ofthe complete URLs.
 30. The system of claim 23, wherein the instructionsfor obtaining the respective set of complete URLs include instructionsfor identifying at least one complete URL in accordance with a modifiedspelling of the partial URL.
 31. The system of claim 23, wherein theinstructions for obtaining the respective set of complete URLs includeinstructions for modifying the respective set of complete URLs based oncharacteristics associated with the requestor.
 32. The system of claim23, wherein the memory stores instructions for: sending the respectiveset of complete URLs to the requestor; determining that no selectioncorresponding to a sent complete URL in the set is received from therequestor within a predetermined time period; and sending to the searchrequestor a subsequent set of complete URLs ordered in accordance withthe ranking criteria.